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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAG AAGC TAAAACGTTG TACTCTTGGA 
201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCC ACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGCA GTATTTCTTT C AAG ACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAAT ATTCCA 
60 1 GCGAACTTCT G AAGG AACTC TTGTTC AATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAAC AAC AT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1 00 1 CTTGAGAAAG TCAAAGATTT TAACTTC AAC TACTATTTAA C AAAATCTTG 
1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTT TAGC 
1151 TACACAACTG ATAATAAATG CGTCACAC AA TTTATTAATG AATTTTTCTA 
1 20 1 C AATATACTC CCCAAAG ACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1 30 1 AAAAACTTAT TGCTTGAGAA G ATCA ATACA AGAGAAATAT CATGGATGC A 
1351 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 
1 40 1 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
1451 CTGATTAGAT GATTTTTCTA TGTCACCG AG C AAC AG AAA A GTTACTCCAA 
1 501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GAAGAATGGA AAAAGTCGCT TGG ATTTGCA CCTGGAAAAC TCAGACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1 701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1 75 1 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1 95 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GG AGG ACAAT ATCCAACCTT ATTCAGTGTT CTTG AAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA 
220 1 AGAAATTATT TTAAG AA AG A TA ACTTACTT CAACC AGTCA TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGAA GTTT TATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 



FIGURE 9 (cont.) 



2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
2551 GACTAGTTTT CCATTAAGTC C AAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
265 1 TCAATTG ATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT C AAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
285 1 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
3151 GCTAAAAAGT G ACC A ATGTC A ATCTCTAAT TC AATATGAT GC ATAGTCGA 
3201 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC i 
3251 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 
51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
1 0 1 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKJ ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELEH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDDEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNN1VIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLrVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI . 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTTTTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



FIGURE 11 

1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAG A AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGT TCTAC TT 
25 1 CTCGGATGC A AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTnTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
35 1 AATCAGGTAA TGAGGATTAT TCTA1 1 1 11 1 AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
50 1 GATCTCG ATA C ATCAGACTT ACC AAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
65 1 TTGAGAC AAT TGAAAAAGCT GTTTACAACT GAAGGAATCG C AGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
£ 751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 
80 1 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
■ i 85 1 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
: 1! 90 1 TTAC AAC AG A TTACCTGTTT TG ATTACTCT TGCTCATCTC TTATATCTTT 
t\ 951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
' « 1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
iij 1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

1101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTA TAG 1 1 1 1 lCATTT 
n U51 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
il 1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
L 1 25 1 TCAC ATTC AT AGATCGACCT TCATATATCC AATACGATGA TAAG GAAACA 
m 1301 GCAGTC ATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

1351 AAG AAATGG A GCCG AAATCT TAATCAAAAA G AATTGCGTC G ATATTGCAA 
m 1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1 60 1 ATACAAACCT TGGTC AAAAT ATTGAGGAAG G AAAAG AAG A CCAGTTAGCA 
165 1 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAG AAAT AA 
1701 AAGATTTATT 1111 I CAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 

♦ ♦ ♦ 60 

GGGGTTTTGGGGTTTTGGC&TTTTCCCG AT 

PQNPKTPKPL'KKKKLR 'F R 
PKTPKPQNPYKKR KNCGSLC 
PKPONPKTPI KKEK I E V V • K - 

AAT AAAAT ATT ATTCCC CC AC AAATGG AG ATGG AT ATTG ATTTGGATG AT AT AG AAAATT 

- ♦ * 120 

TTATTTTATAATAAGGCCCTGTTTACCTCTACCT^ 

NKILFPHKWRWILIW HI.'KI 
IKYYSRTNGDGYCFGC*Y.RKF- 
* N I I ? A OM C M D I D L D D I EN L - 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCn^TAGTGACAAGAAAGGATGC 

♦ ♦ 180 

ATG AAGGATT ATGT AAG TTCTTC AT ATCGTCG AG AAC ATC ACTCTTCTTTC 

YFLIHSTS IAALVVTRKDAK 
TS-YIQQV-QLL- Q E R M Q N - 

LPNTFNKYSSSCS DKKOCKT- 

CATTCAAATCTGGCTCGAAATCGCCTTC ATTG ACT ATTCC AAACTTGCAAAA^ 

- 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACG1TTTTGTTAATC 

HCNLARNRLHC L F Q SCKNN 
IEIWLEIA FI DYSKVAKTIR- 
LKSGSKS PS LT I PKLQKQLE- 

ACTTCTACTTCTCCGATCCAAATCTTTATAACCATTCTT^ 

-- -- -- » 300 

TCAACATCAACACCCTACGTTTACAAATATTCCTAACAAACAACTC 

SSTSHMQI F I T I LSC EN • • p • 
VLL LCCKSL • R FF LEK I S F K - 
FYTSDANLYNDSr LRKLVLK- 

AAAGCC^AGAGCAAAGAGTAGAAATTGAAACATTACTAATCTTTAAATAAAATCAGCTAA 

360 

TTTCGCCTCTCCTTrCTCATCTTTAACTTTCT 

KAESKE " K.LK H Y ' C LNK I R 
KRRAKSSNCN 1TNV IKSGN - 

SOEQRVB I ET LLM T K N Q V H - 

TCAC^ATTATTCrATTTTTTACATCACTTCTT 

- 42C 

ACTCCTAATAAGATAAAAAATCTAGTCAAGAATTCCTCCTA^TACCTCTTTTAATGAATT 

CGLFYFLOHFLRS I M E K I T " 
E 0 Y S I F • ITS GALWRKLLN- 
RI ILFFRS LLKEH YG ENYLI- 

T ACT AAAAGGT AAAC AGTTTCG ATT ATTTCCCT AGCC AA C AATG ATC ACT AT ATT AAATT 

- -. 480 

ATCATTTTCCATTTGTCAAACCTAATAAATCCATCGCTTCTTACTACTCATATAATTT 

Y-KV NSLDYFPSOOCCVY-I 
TKR-TVW I ISLANNOEYIKF- 
LKGKOFGLFP " PTMMSI L N S - 



W FIGURE 12 (cont.) W 

CATATGAGAATCAGTCAAAGGATCTCGATACATCACACITACCAAAGACAAACTCCCTAT 

481 ♦ * * 540 

CTATACTCTTACrCACTTTCCTACACCTATGTAGT^ 

HMRMSQRISIHOTYQRQTRY 
ICECVKGSRY I RLTKOKLA I - 

Y eneskdldts dlpktnsl 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 - -"- " 600 

TTTTGCCTTCTTTTTCAAACrATTACC^ 

KTQEKVC-SNSRRTYCIYYS 
KRKKKFDNRTAEELIAFT I R- 
NARKSLIIEOOK NLLHL LFV- 

TATGCCTTTTATTACAATTGTTTTAGCTATCGAC^ 

601 " * * * 660 

ATACCCAAAATAA7CTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

A YGFYYNCFRYRRCTPESCON - 

** * ________ . ~ » c* *r> t - 
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MG F I T IVLG I DC E L P S LET I 

c WV LLOLF-VSTVNSRVLROL- 

j„h TGAAAAAGCTGTTTACAACrCAACGAATCGCAGTTCTCAA^ 

661 * * * * * 720 

i») ACTTTTTCGACAAATGTTGACTTCCTTACCGTCAAGACTTTCAAGACTACACATACGGTA 

K :l a CKSCLQLKESQFCKF-CVCH - 

U! b ekAVYNCRNRSSESSDVYAI- 

g c KKLFTTEGIAVL K VLMCMPL- 

"""" 

III TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCT 

;s 721 - ♦ — * 780 

w ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCrri'GTrT 

LSI 



yfvn-sqisylnlmdsyrnk 

; b ILCINLKYLISI"WIAIETN 
;-| c FCEtlSNlLSOFNG'L-KOT 

^ CCAAATAAACCATCCAAGTTTAATGGAATATACGTTAAATCCTTTCGGACAAATGCACAC 

-l 78i * *m 

GCTTTATTTGGTACCTTCAAATTACCTTATATCCAATTTAGCAAACCCTCTTTACGTGTC 



:«, a PNXPCKrNGI YVKSFGTNAH 

b OI WHASLMEYTLNPLCOMHT- 

!|j c K - T >. 0 V ' w N 1 R • I L W O K C T L - 

TG AATTT AT ATTGG ATTC TT AAAGC AT AC AT AC AC AG AATGCTTT AG AG ACTG ATTT ACC 
841 9< 

acttaT-atat aacc taac.-.atttcctatctatctgtcttacgaaatctctgactaaatcc 
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d £ I Y 5CFLXHRYTECFROCFS 

b E * F i LOS * S I D T 0 AL E T D * - A 

c N L Y W ! L K A " I H R M L " R L I " L - 

TT AC AAC AG ATT ACCTGTTTTG ATT ACTCTTCCTC ATCTCTT AT ATCTTT AAAAG AAGC A 

901 " 9 * 

AATCTTCTC T A_ATGG AC AAAACT AATG AG AACG ACT AG AG AATAT AG AAA'rriTC TTCGT 

LOOiTCTDYSCSSLISLKEA 
b ysiRLPVLITLAHLLYL'KXQ- 
c TTOVLFC LLL L i SY I F X R S R - 

GGCGAAATGAAAACAACACTAAACAAACAGATTTCAAAATTTCTTGATTCTT 

961 ' " * 1( 

C C GCTTT ACTTTTC TTC TG ATTTCTTTCTCT AAAGTTTT AAAC AACT AAG AAG AC ATTGG 



G c M K R R L X K E I SK'FVDSSVT 
b A K C K £ 0 ' R K R F 0 N L L I L L ' P 

c . 



R N E X X T X ERO FX I CCFFCNR- 
GC AATT AAC AAC AAG AA T ATT AGC AACG AAAAAG AAG AAG ACCT ATC AC AATCCTG ATTC ^ ^ 
CCTTAATTGT TGTTCTT A 7 AA TC CTTGC TTTT T C TTCTTCTCG AT AGTGTT AGG ACT AAG 



c , Nf , XNISNEXSEELSOSCF 
b E " L T T R I LATK KXK SYHNPOS 

c K * 0 0 E Y " 0 R ^ RRRA ITILIL- 
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TTAAAGATTTCAAAAATTCCAGGTAAGACAGATACATTCATTAAAATTCATATATTATAC 

1081 «- — * ■— * 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATCTAAGTAATTTTAA 

a LKISKIPGKRDTFIKIHI L - 

b .RFOKFQVREIHSLKFIYYS- 

c KDFXNSR*CRYIH*NSYI'IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTT^ 

1141 * 1200 

AAAAAGT AAAGTGTCG AC AAT AAAAG AAAAT AG AATTGTT AT AAAAAACT AATCC ACCTT 



a FFISQLLFSFIL TIFFD'LE 

b FSFHSCYFLLS-QYFLISWK- 

c FHFTAVlFF-YLMMlFCLA.es- 



GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 
CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCC 



a VKSIK'EKR-TEVT-LIHIH 
b • KVSNKRSARLR * LSLFTF I 

c KKYOIREALDCGNLAYSHS' 



AGATCGACCTTCATATATCCAATACGATGATAAGGAAACACCACTCATCCCTTTTAAAAA 

^ ^ — — — — — — — ♦ — — — - — --♦—-——————-»-— - - — — — * 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGG AAAATTTTT 



a RSTFIYPIRC*CNSSHPF-K 

b DRPSY IQY DDKETAV I R FKN 

c IDLHISNTM1RKQ0SSVLK I - 



TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTT 
ATCACGATACTCCTGATTTAAAAATCTCAGTTCTT^ 



a . CYE D-IFRVKKWSRNLNOK 
b SAMRTKF LESRNGAEILIKK 

c VLCGLNF " SQEMEPKS " SKR 



GAATTCCCTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 
CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 



a ELR3YCKR IEL* IFR " ' V L P 

b NCVDIAKESMSKSFVNKYYO- 

c IASILQKNRT LNLSLISITN- 

ATCTTG A7TC A7TG AAG AG ATTC ACG AGGC AACTGC AC AG AAG ATC ATT AAAC AAAT AAA 

1441 " l * 00 

TAC AACTAACTAACTTCTCT AACTGCTCCGTTG ACCTGTCTTCT ACT AAT I'TC T TTATTT 

a ILIOCRO-RCMCTEOH-RMK 

b SCLIEE1 DEATAQK \ \ KEIK 

c LOCLXRLTROLHRRSLXK'S- 



GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 
CATTCAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTACTCGCTACAACTT 



a VTFiN'R I N. • ITN I Z I S D L Q 

b • L L L I R E • T K L L I • R S A £ F N 

c nfY-LENKLNY-YROORSSI 



TTG ACG AAAT AAAAGCTG AACT AAACTT AC AC AAT AAAAAAT AC AAAC CTTGCTC AAAAT 
AACTGCTTTATTTTCCACTTCATTTCAATCTGTTA 



a LTK'XLM-S-T'IKMTMLGOM 
b CRNXSCTKVRO-KIQTLVKI 
C DEIXAELKLDNKKYKPWSKY 



ATTG AGG AAGG AAAAC AAG AC C ACTT ACC AAAAG AAAAAAT AAC GC AAT AAAT AAAATC A 
T AACTCCTTCCTTTTCTT CTGGTC AATCGTTTTCTTTTTT ATTC C CTT ATTT ATTTT ACT 



a iceGXEDQLAKEXIRO-IXC 
b LRKSXKTS * 0 X X X • G M X * NE 

c CCRXRRPVSXRXMXAIMXMS 
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CTACACAACTCAACAAATAAAACATTTATTTTTTTCAATAATTTATTCAAAACAGGCCTT 

1681 * *- * * 1 

C ATCTC TTC ACTTCTTT ATTTTCT AAAT AAAAAAAGTTATTAAATAACTTTTCTCCCC AA 

a v qxCRNKRFIFFNNLLKRGV 

b YRSEEIKOLFFSIIYCKEC. F - 

c TEVKK-K IYF FO'FIEKRCF- 

TTCCCC TTTTCCGCTTTTGCCG 

1741 .-- ------ 1762 

AACCCCAAAACCCCAAAACCCC 

a LCFWCFG 

b wcrcvLC- 



c 
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FIGURE 13 



2 EVDV3N0 ADNHG I HS ALKTC EE I KEAKTLY SW I QKV IRCRNQSQSHY KOL 51 

19 ELELIMOENONDlWw- . I DDPKQY . LVNN/TAACLLQEGSYYODK 62 

52 EDIK I F AOTNIVATPRDYNEEDFKVIARKEVF . STGLMIELIDKCLVELL 100 

6 3 DEBS VI ITKALL. . . . EVAESDPEFICQLAVYIRN^YIRTTTNYI VAF 107 

101 SSSDVSDROKLOCFGFOLKGNOLAKTHLLTALSTQKOYFFOD^O^ 150 
. 08 cwHKNTOPFIEKYFNKAVLLPNDLLEVCEFAOVLYI 144 



151 ICNELFBHLYTKYLIFOBTSEGTLVOFCGNNVFDHLKVNDKFDKKOKGGA 200 
145 FDATEFKNLY [dbilsodirkeltfrkclqrcvrskf 181 



201 ADMNE. • PRCCSTCKYN\nWEKDHFLNNINVPNVWNMKSR 247 

18 2 SEFNEYOL.GKYCTES . . QRKKTMFRYLSVTNKOKWOOTKKK 220 

248 NRNNQ F FK KH EFVSNKNN I S AMDRAQT I FTN I FRFNR I RKKLKDKV I EK I 297 

221 ] *K&LLTKLQ*lKZSZDKSKnm DIMNVEDAIKALKPAVMKKI 264 

298 AYMLEKVKDFNFNYYLTKSCPLPENWRERK^^^ 347 

KHMKAPKIPNSTLESKYLTFKD 294 



265 AKRON AM K 
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FIGURE 19 



1 aactcattta attactaatl taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagn agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaaciac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatacttlc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttla caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
841 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcclaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttaclaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca- aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1681 gtatggncc gttcgtactt gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1 74 1 acgttgtgaa aagtcctcat tctacatctt cagttcacct agttctcaat gcaataagtg 
1801 ttacttagaa gngatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcnaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag tlatcaaaaa 
2221 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
2341 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 
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MEIEhfNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 
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YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 

QDIRKELTFPJCCLQRCVRSKJSEFTvJEYQLGKYCTESQRKKTMFRYLSVThnCQKWDQTK 

KKiUGENLLTKLQAlKESEDKSKRETGDIMNW 

APK1PNSTLESKYLTFKJDLIKFCH1SEPKJERVYKJLGKKYPKTEEEYKAAFGDSASAP 
FNPELAGKJIMKJEISKTWENELSAKGNTAEVWTD^ 

SDTTOSrVTNKJCEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKRENMN^ 

AVKJEVVEKTDEEKJCDMELEQTEEGEFVKVNEGIGKQYINSIELAIKJAVNKNLDEIKG 

HTAIFSDVSGSMSTSMSGGAKJCYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQ 
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FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
61 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
181 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
54 1 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
601 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
661 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
781 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
841 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacngaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagncct tgcgaagcgt tctaalattt agttaactcc tcatcataaa ttagcgttaa 
1021 agaiagctaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tctlataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa attlaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
174 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1801 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1 86 1 gtatacmt aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1921 ttcctcatta gaaagctlag aagatanga tagtctttgc aaatctattg cttcttgtaa 
1981 aaatrtacaa aatgttaata ttatcgccag tttgctctat cccaacaata tttagaaaaa 
2041 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aangactgt gagtgaagta cataagtaag tatgggaaaa 
2341 ccacaagcaa-aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
2641 cgattacaat tcagatagat gglgattaat taaatanag ttlaaalaaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
2761 atatatttta gttatttaat tcanatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaaicg 
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MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRJGALPCLPGDLSHKAV1DHCIIYLLTGELYNNVLTFGYKIARNEDVNNSLFCHS 
ANVNVTLLKGAAWKJVIFHSLVGTYAFVDLLINYTVIQFNGQFFTQIVGNRCNEPHLPPK 
WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKXLTDLR 
EAIFPTNLVKIPQRLKVRINLTLQKXLKRHKJU.NYVSILNSICPPLEGTVLDLSHLSR 
QSPKERVLKFHVILQKLLPQEMFGSKKNKGKJIKNLNLLLSLPLNGYLPFDSLLKKL 
RLKDFRWLFISD1WTKHNFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVTI 

VYFRHDTWNKLITPFIVEYFKTYLVEhflWCRN 
t; IIAIPCRGADEEEFTIYKJENHKNAIQPTQKJLEYLPJ^KJ^TSFTKIYSPTQIADRIKE 
fi FKQRLLKKFN>TVLPELYFMKFDVKSCYDSIPRMECMRILKDALK>fENGFFVRSQYFF>l 
m TNTG VLKLFNWN AS R V P KP Y ELY IDhAf RTVHLSNQD VIN VVEMEIFKTAL WVEDKC Y 
S IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 
J;i INIKKLAMGGFQKYNAKANRDKJLAVSSQSDDDTVIQFCAMmFVKELEVWKHSSTMN 
Z NFH1RSKSSKGIFRSLIALFNTR1SYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 
3 KDLSrNVTQNMQFHSFLQRIIEMTVSGCPlTKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKJDNIILLRKEiQHLQAYI YIYIHIVN 




FIGURE 24 

Oxytricha LCVSY1LSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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FIGURE 25 ^ 



Motif 0 

AKFLHWLKS VY WELLRS FFY VTETTPQKNR 



Motif i 

^FRHDTVHKLITPFIVEYFXTYLVEHWCRH^ 

ty^yrkniwdvimkmsi-adlkketij^evqekevbewkks-lofapgklrlipkk ttf 



.vmoufor KKYF VR H>I KSCY DRI KQDLMFR I VKK-JCLKDPEPV I RK Y AT I HATS 
EBFVCK WKQVGQPKLFF ATMD I EKC YDSVNREKLSTFLKTTKLLSS DFW IMT AQ I L.KRKM 



FIGURE 26 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTGAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTCACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTAT^TT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTA^^ 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

^ATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GA^TCAATTGGCGATTTGTOTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTA 

AC^GACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

^TGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

a^gggcSgaagaagaa^ 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCGGACTAGTTTTACTAAAATATA 

??c?cc^a^aIatagctgaccgtatcaaagaatttaagcagagactttt^ 
c^acI^ggatg^ 

Stotogatctcaatatttcttcaataccaatacaggtgta™ 
tc?5aIcg??agcagagtaccaaaaccttatga^ 

tcat^atcaaatcaggatgttataaacgttgtagagatggaaatatttaaaacagcttt 



rAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGAC^AACAGCAA^ 

TCrGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAAT^TCTGCAATGCACATATTTGTTAAAGAATTGG 



CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTA^ 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 



AGCC 

TTTTAAGGATCTATCAA^AATGTTA^ 
rATCA^TGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

tgIgctIcg^tcaccatattc^ 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



« % 
FIGURE 27 ^ 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG1RQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREICR 

AERLTSRVKALFSVLNYERA 



FIGURE 28 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 



FIGURE 29 



MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLE11GSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSICKRK 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHSICDRNTVHMWLQWIFPRQFG 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPY1DTHDDEK1LS 

YSLKPNQVFAFLRS1LVRVFPKLIWGNQRIFEI1LKDLETFLKLSRYESFSLHYLMSNIK1SEIEWLVL 

GKRSNAKMCLSDFEKRXQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKINENKVRMDTQKTTLPPAVIRJ.LPKXNTFPJ.ITNLRKJU ? L1KMGSNKKMLVST>IQT 

LRPVASILKIILINEESSGIPF^LEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

IVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITWKXDAK^LNLSLRGFEKIWFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFID1THNSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 



W FIGURE 30 W 



Mtaccgamacmccracnc'ataagctaattgcttcctcgaacgctcctaaatctctggaaatamnacaagaactcaataacaataccaagtcaaanccaatatg^ 

tgtunagtgatcgataatamctatmatcggtcgttaw^^ 
ggncgcttactmaatcgtggtactgtmagctgctacttctagccaa^ 



ctgatgagactatattagancanacagiccgigcaiai"-ii<»a«-«»iB6 o B l '^ l " lv « ,v ""*6"'6-6 , ~-&— 1> — o—oo-o — » ■■» " 

aaaaggngataattamgcaaaatcatgtcctugtggtggtaatccgcgaaagtttmgatgcttgcacacgtctagcatgangagatanca^ 

ctccmaacgcggtttuttmctatmctanctcatgttgttccaaatatgtatcatctcgtatuggctttmcc 

aataatctaaanagmcgcttataangatagtagtagaaagattggtg^^ 

aaaatcctataanataaaunaatcaatamgcggtcactatttt^^ 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 
rCTTAAATGATTATGTACAACTTGTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

cgc^Sagaagcgatgtacaaacgtccttttctatit^ 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatattmgtTOgattttmctattcg 
^agT^ 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattc^^ 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 
rcriTATATXrrATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaa^ 

CA^ATgS 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 
rTGGAA^TAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAG 

tggc^caItc 
ag?gI££cacto^^^ 

acaaacagcaaagcgactccatcgtatttctctatcaaa 
Ja^awatg^tgaaaaaatccttagttato 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 
ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccattaattaacaatcagATTTCAGAAATTGAAT 

ccttg<^aaaaggtcaaatgcgaaaatgtgcttaagtgattttgag 

AATTrATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATCTTTTTTTTATATCACTGAATC 

aagTgat^acgaaatcgaactg^ 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtannaaagUtttmgcaaaaagctaatatmcagAACAA 

tc™Satggatactcagaaaactactttgcctccagcagttattcgtctattacctaagaaga 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtanaattm 
ca^GG^ 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTrc^ 
TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattattaatmgcagGCGTAAG^ 

tTtt^gtacggatagatataaaatcctgttatgatcgaat^ 
Ia™aa C ct^ 

AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagmatttmcattggaatttmaacaaanctttt^ 
rrT^CTTTTGAAAAAGTCGTGCAGTTACT^ 

ggI^aTSaccIaaag^ctg 

?AASutIIca^g^ 

?a?gaIaaagI?^ 

AGGATGCAAAAAAATTmGAATTTATCTTTAAGAGgtgagttgctgtca^ 
ACAC^A^CTACGAGCCTGGAGAAAACAGTAATAAAC^^ 

^c^^aatgaaagcaagaaaagaatgcca^^ 
ataca^gttagcatgtcctaaaattgatga 

ATATGGGGAAATCTTTTTTTTACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATC^ 



FIGURE 30 (cont.) 



ATGetacEtRtcggtctcgagacttcagcaatattgacacatcagGCTTTTTTGTCTTGGAATGAG 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtglcamicaatnanatatacatcctt 
ta „actggtgtcttaaacaatattattacttagtatag^ 



aco 



faScac^ 



FIGURE 31 
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EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



EST2 pep 

Euplotes pep 

Trans of tetrahymen 

Consensus 



^EST2 pep 
OEuplotes pep 
UJVrans of tetrahymen 

^Consensus 



! ^EST2 pep 
^Euplotes pep 
LflTrans of tetrahymen 
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FFYCTEISST VTXVYFRHDT WN KLIT I FIVE YFK-TYLVEN 

FFYVTEQOKS Y2SKTYYYRKN IWDVI-MKMS IAD LKK ETLA—EVQE 
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KIQL&EENLE KVEEKLIPED SFQKY 

K. . . E F 



GFAPCK^fiLlij 



rtttr. 



fpi 



MPCRG 
tfTFNKK 
p^HlbTTFLRK 

IE3HIMTF . RK 



ADEEBFTIYK ENHKNAIQ^KILE^hW FlPTSFTKIYS WQIJDRJKE 

rVNSDRKTTK LTTOTM^^UlIJCto RMFK -J rGFAV^ 

DKQKNIK L^Il^SLVFB^Cfi-- ML-G -Q.<IGYSVFD 

- . . IG . . YF . 



FKQRLLN 
YD-DVMK 



FN NVL 

\fE EFVCKWKQVG C 
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S-l: FFY VTE TTF QKN RLF FYR KS V WSK 
S-2: ROHLKRVQLRDVSEAEVRQHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AKFLHWLMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GJR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 
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Vector Genomic DNA Insert 



Vector 



ill 
'Si 

m 



AS 



B2 



5825 bp Sequenced 
-2 kb Hind III Fragment 




t kb 



-J 



CO 
o 
ru 



RT Motifs 12 3(A) 4(B*) 5(Cr6(D) 
i / \ \ 



m mw^m^\m.mn mm mm 



Introns 



2 I 



<:DNA 
5-2d 



cDNA 



6 7 8 9 10 11 




RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Bdnc 
RT-PCR w/ M2-B15 Be nc 
RT-PCR w/ M2-B16Bcnc 



Hind III 



500 bp 
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ictJ 

in 



Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4(B') 



5(C) 

D D Y L L I T 

3'- ctg ctg atg gag gag tag tgg -5 
a a aaaaa a a 
t t t t 
c c 
Polv 1 




Motif B' (4) 

QTKGIPQG 



Motif C (5) 
DDYLLIT 



FIGURE 36 fp 



PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 

Ea d123 kgipqglcvssilssfyyatleesslgflrdesmnpenpnvnllmrltddyli.it 

St) M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

SCJ>103 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLILKLADDFLIIS 



q L"k v g i p Q g 

cac&aa get ggt acc ccc cag gg <— Actual Genomic Sequence. 



% t t c 

ta!fa g c c teg 

ca#Macc aaa gga att cca taa gg 

111 



ag acc aaa gga att cca tea ggC TCA ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
tcOtgg ttt cct taa ggt agt ccG AGT TAA GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 



M K G I P S 



M 



GAa' GAT TTG ATT GAT GAA TAC CTA TCG TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA 
CTT CTA AAC TAA CTA CTT ATG GAT AGC AAA TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 



GTA GTC gac gac tac etc etc ate acc 
CAT CAG ctg ctg atg gag gag tag tgg 



ctg ctg atg gag gag tag tgg 
a a aaaaa a a 
t t t t 
c c 

Polv 1 



K K 



gac gat 
D D 



ttc etc ttt ata aca 
F L F I T 



< Actual Genomic Sequence. 
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3* RT PCR Strategy 




AAAAAAAAAAAAAAAA 
TTTTTTTTTTTTTT1 



3* 



1. Synthesis of cDNA wtth Qt Primer. 




4. Sequence Second Round PCR Products Using Inside Primer or Q, Primer. 
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-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5-6kb 
7 - 8kb 
11 - 12 kb 

-Libraries from J. A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
k , REP Library from R. Allshire 

□ REP81ES Library (old) 

□ REP81ES Library (new) 
REP41ES Library 
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Original PCR 1 
RT-PCR 






rl 1 


500 bp 




Mg Cone 
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310- 
234- 
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FIGURE 39 



A5 B2 




Hind III Digested Positive Genomic Clones 



% % 
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5' RT PCR Strategy 



ddA- 



mRNA 



-P 



I AAAAAAAAAAAAAAAA . 



1 . Synthesis of cDN A with Specific Downstream Primer. 

mRNA 



5' 
3' 



I AAAAAAAAAAAAAAAA . 



jugate Oligo with 5'-P and blocked 3* to cDNA using U RNA Ugase. 



1 First Round PCR 

CI 

m 

it** 



4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

Tezlp (429). WLYNSFI IPILQSFFYITES5DLRNRTVYFRKDIW ...(35)... 
Est2p (366). WLFRQLI PKI IQTFFYCTEISSTVT- IVYFRHDTW ...(35)... 
pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNIW ...(35)... 
* * *** ** * * * 

Motif 1 Motif 2 K 
p hh h K hR h R 
Tezlp AVIRLLPKK--NTFRLITN-LRKRF ...(61)... 
Est2p SKMRI I PKKSNNEFRI I AI PCRGAD ...(62)... 
pl23 GKLRL I PKK- -TTFRPIMTFNKKIV ...(61)... 

* * * * * * * 

Motif 3(A) AF 

h hDh GY h 
Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK ... (89) . . . 
Est2p ELY FMKFDVKSCYDS I PPvMECMR I LK ...(75)... 
pl23 KLFFATMD I EKCYD3VNREKLSTF LK ...(107)... 

* * *** * 

Motif 4 (B' ) 

hPQG pP hh h 
Tezlp YLQKVGIPQGSILS3FLCHFYMEDLIDEYLSF ...(6)... 
Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF . . . (8) . . . 
pl23 YKQTKG I PQGLCV S S I L S S F YY'ATLE ESS LGF ...(14)... 
***** * * 

Y Motif 5(C) Motif 6(D) 

h F DDhhh Gh h cK h 

Tezlp vr.LPVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENS . (205) 
Est2p LI LKLADDFLI ISTDQQQVINIKKLAMGGFQKYTJAKAr-lRDKILAVSSQS . (173) 
pl23 LLMP.LTDDYLL I TTQENNAVLF I EKLINVSRENGF K FNMKKLQTSF PLS .(209) 



% 
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^ FIGURE 43 ' w 

Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 
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telomerase gene::M 
wild type telomerase gene 




2. Assay in selective media. 

3. Sporulate. and grow on 
selective media. 
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wild type telomerase gene 
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telomerase gene::M 



Wild Type (X2) 



Senescence? (X2) 

(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 



1500 bp 



600 bp 



100 bp 



0 

i+ ** 
So g s 

H N N N N N 

Q <p o o © 


o 

3 o IS 

^3 3 > 

73 73 73 73 73 £f 

© O w O V 


o 

go 2 « 

C 3 3 ^ 
+ :: + + ii £ 

73 73 73 "n 73 y 

O j> <D <D <P <P 


o 

O O O <D 

3. ' 3 > 
+ :: ♦ ♦ :: + 

73 73 73 73 73 w 
<d ^ © © © © • 










M2-T8 
& 

ura-T1572 
Xca 1 N^2 


M2-T6 
& 

ura-T1572 
-T2 -► 


M2-B13 
& 

ura-665 
Xca 


| M2-T2 
& 

1 M2-B3 

1 




. M2-B3 



M2-T6 




ura-T1572 



+pDJn 
ura-B85 



i % 

W FIGURE 45 W 



fez J disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 



^ ,^^^ ac tttcctttcttcataa g ctaattgcttcctcgaacgctcctaaatctctggaaatacttttacaaga 80 

61 ccaagtaLaggacaaaLgaacaacttccttccccctaaagacttttactttattaatttacttttcaaatat^ 240 
^ttcacttac?tttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 

111 t^acatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 
111 lll^ Ho 

881 Sattta^ 958 

959 ATG A CC GAA CAC CAT ACC CCC AAA AGC AGO ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 
1M, TEHHTPKSRILRFLE NW 

1 c 1 4|,cc™ TCT -™ rrrrrrrrrrr cc= rf c r 10 

2 1; ?y L C T L N U i v y " 

107JU AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 11 
4ify SMICERLRSDV.QTSFSI* 

1139 3 iAT TCG ACT GTA GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 



1018 
20 

1078 



1138 



61HH S TVVor~~K 



1198 
80 



FIGURE 46 (cont.) 

1199 AAA TGC TCA CAG TCA GAG gtatatatatttttgttttgatttttt tctattcgggatagctaatatatgggcag 1272 

81 K C S 0 S E 86 

1273 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 

37 L I ANVVKQM F D E S F ER R R N L 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtattctaattgtgaaatat ttacc tgcaattactgt t tcaaagaga 1405 

107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 

114 NHED F RAM H VNGVQ N 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 

129 DLVSTFPNYLISI LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaa taccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 

149UU L L E I I G 155 

1602^AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

156|li DAMH YLLS K G S I F EAL P N D 175 

1662"rAAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 1721 

176|Jfl YL QISGI P L FKNNVFEETV 195 

1722 !li k:A AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

196" S KKRKRTI ETS I T QNKSARK 215 

Q 

1782HSAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

216HE VSWNS ISISRFSI FYRSSY 235 

1842L!XaG AAG TTT AAG CAA G gtaactaatactgttatccttca taactaatt ttag AT CTA TAT TTT AAC 
1907m 

236 KKFKQD LYFN 245 

1908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 LHSICDRNTVHMWLQWI F PR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 QFG LIN AFQVKQLHKVI PLV 285 

2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

286 SQSTVVPKRLLKVYPLIEQT 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 AKRLHRISLSKVYNHYCPYI 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

326 DTHDDEKILSYSLKPNQVFA 345 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

346 FLRSILVRVFPKLIWGNQR I 365 



2268 TTT GAG ATA ATA TTA AAA G gtattgtataaaatttattaccactaacgatt ttaccag AC CTC GAA ACT 2336 
366 F E I I L K D L E T 375 
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376 F L» K L S K i * 

2397 gtaatatgccaaacctttttaccattaatt 
396 

2466 AAA AGG TCA AAT GCG AAA ATG TC 

406 KRSNAKMC 

2526 GAA TTC ATC TAC TGG CTA TAC A/ 
426 E F I Y W L Y N 
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FIGURE 46 (cont.) 

3533 c.aa.gaaactag ATA CCA AAT TCT CAA TAC CTT CAA AAA CTT GGT ATC OCT CAC GGC TCA 3593 



693 IGNSQYLQK 



3,94 ATT CTG TCA TCT TTT TTC TCT CAT TTC TAT ATG GAA GAT TTG ATT GAT CAA TAC CTA TCG 3653 
ll\ I Ij S SFLCH F Y M EDL I D EY L S 728 

3654 TTT ACG AAA AAG AAA CCA TCA CTG TTC TTA CCA CTA GTC CAC GAT TTC CTC TTT ATA ACA 3713 



3714 CTT AAT AAA AAG GAT CCA AAA AAA TTT TTG AAT TTA TCT TTA ACA G gtgagttgctgtcattcc 3777 
749 VN KKDAKKF LNLSLRG 



3840 
778 



rj . rpqyji at , G C AC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 

3778 caagttctaaccgttgaag GA TTT GAt> aaa ami 
765 FE KHNFSTSLEKTV 

3841 ATA AAC TTT GAA AAT ACT AAT CCC ATA ATA AAC AAT ACT TTT TTT AAT CAA AGC AAG AAA 3900 
_ cm cisir T I N NT F FN E S K K *y° 
77 9 I N F ,E N S N O l J- w 

390|::AGA ATG CCA TTC TTC GGT TTC TCT GTC AAC ATG AGC TCT CTT GAT ACA TTC TTA CCA TCT 3960 
79^R MPFFGFSVNMRSLDTLLAC 

396fflcCT AAA ATT CAT GAA GCC TTA TTT AAC TCT ACA TCT GTA CAC CTG ACG AAA CAT ATG GCG 4020 
81^P KlDEALFNSTSVb 



402|AAA TCT TTT TTT TAC AAA ATT CTA AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 4089 
83SM K SFFYKILR 



4 099 AGC CTT CCA TCC TTT CCA CAA CTA TTT ATT CAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

4J0TGC TGC AAT ATA TAT ACC CTA CCA TAC TCT ATG TGT ATG ACA GCA CAA CCA TAC TTA AAA 4209 
8691 C CNIYRLGYSMCMR AQA. 

4210 ACG ATG AAG GAT ATA TTT ATT CCC CAA ACA ATG TTC ATA ACG G gtgagtacttattttaactaga 4274 
889 R M K D I F I P Q R " F I T D 

4275 aaagtcattaattaaccttag AT CTT TTC AAT CTT ATT GGA ACA AAA ATT TGC AAA AAG TTG CCC 4339 
904 L L N V I G R 

4340 CAA ATA TTA GGA TAT ACC ACT ACG CGT TTC TTC TCC TCT GCA GAA GTC AAA TG gtacgtgtc 4401 
918EILGYTSRRFLSSAEVKW 

P ^ t ««r af r a( i G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 
4402 ggtctcgagacttcagcaatattgacacatcag G ctt tti 1 1^1 u ^ ^ ^ ^ ^ g46 

936 L 

4469 CCC TCT TTC AAA TAT CAT CCA TCC TTC CAA CAC CTA ATA TAC CAA TTT CAC TCA TTG ACT 4528 
947 P S F K Y H P C F E Q L I Y Q F Q S L T 

4529 CAT CTT ATC AAG CCC CTA ACA CCA CTT TTC CCA CAC GTC TTA TTT TTA CAT ACA ACA ATA 4588 
967 D L I K P L R P V L R Q V L F L H R R I 

4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 



987 A 



v FIGURE 46 (cont.) » 



4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagttttgattgacttgtct 4745. 

4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccaccaaacgggagtggttaaaca 4825 

4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattaccttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4985 

4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagc 5065 

5066 cccaggttatccacggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaatactaatagctcattta 5145 

5146 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 

5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcctcaaaaactcc 5305 

5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 

5386 aaaaaagaaaataccattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 5465 

5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 
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l 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser lie 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
qly ile arg gin his leu lys arg' val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

qlu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 , , 

thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

qlu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 

TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arq leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro. 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 

400 410 

ala tyr arg phe his ala cys val leu gin leu pro phe his gin 
GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG 

420 

qln val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG .GAT 

450 

« val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
S GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

? 460 470 

% arq ala val ala val pro pro ser ile pro ala gin ala asp ser 
% CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

i.H 480 

iS thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
U ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

|s=i> 

|»L 4 90 500 

iM pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 

E» CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 



510 

pro qly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 , , . 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 , . ' 

arq qly qly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 



CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 



% 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
CCCGCCGCCCCCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
TTCCAGGCCT TGGATGCGAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 
GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
CGCAGTGCGC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 
GCGGTC ACCC CAGC AGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 
GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
CTGTCGGAAG CAGAGGTCAG GCAGC ATCGG GAAGCCAGGC CCGCCCTGCT 
GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CCAGAAGGCC 
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GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
GCTCCATCCTCTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
GGATGATTTC TTGTTGGTGA CACCTC ACCT C ACCCACGCG AAAACCTTCC 
TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 
ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
GCCCTGCCCT CCTTTGCCTT CCACCCCCAG CATCCAGGTG GAGACCCTGA 
GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTC AAATT 
GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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FIGURE 53 



met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro ara ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg clu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly oro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGC- CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 , 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg Dro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lvs glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

glv ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG A^C GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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140 

trp gly leu leu leu arg arg 
TGG GGG CTG CTG CTG CGC CGC 



leu leu ala arg cys ala leu 
CTG CTG GCA CGC TGC GCG CTC 

170 

ala tyr gin val cys gly pro 
GCC TAC CAG GTG TGC GGG CCG 



thr gin ala arg pro pro pro 
ACT CAG GCC CGG CCC CCG CCA 

200 

leu gly cys glu arg ala trp 
CTG GGA TGC GAA CGG GCC TGG 



val pro leu gly leu pro ala 
GTC CCC CTG GGC CTG CCA GCC 

.230 

ser ala ser arg ser leu pro 
AGT GCC AGC CGA AGT CTG CCG 



ala ala pro glu pro glu arg 
GCT GCC CCT GAG CCG GAG CGG 

260 

ala his pro gly arg thr arg 
GCC CAC CCG GGC AGG ACG CGT 



val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 



150 

val gly asp asp val leu val his 
GTG GGC GAC GAC GTG CTG GTT CAC 

160 

phe val leu val ala pro ser cys 
TTT GTG CTG GTG GCT CCC AGC TGC 

180 

pro leu tyr gin leu gly ala ala 
CCG CTG TAG CAG CTC GGC GCT GCC 

190 

his ala ser gly pro arg arg arg 
CAC GCT AGT GGA CCC CGA AGG CGT 

210 

asn his ser val arg glu ala gly 
AAC CAT AGC GTC AGG GAG GCC GGG 

220 

pro gly ala arg arg arg gly gly 
CCG GGT GCG AGG AGG CGC GGG GGC 

240 

leu pro lys arg pro arg arg gly 
TTG CCC AAG AGG CCC AGG CGT GGC 

250 

thr pro val gly gin gly ser trp 
ACG CCC GTT GGG CAG GGG TCC TGG 

270 

gly pro ser asp arg gly phe cys 
GGA CCG AGT GAC CGT GGT TTC TGT 

280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 
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290 300 
giv ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT. GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
tiro asp thr pro cvs pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu ty r ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 3 60 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC C-CT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg ara leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala aia val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 

430 

ala ala glv val cys ala arg glu lys pro gin gly ser val ala 
GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC TCT GTG GCG 
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440 450 
ala pre glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

lo U arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 ' 480 

ala cy* leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

h^'s asn glu arcr arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 
leu gly ivs his ala lys leu ser leu gin glu leu thr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala ala glu his arg leu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys ohe leu his trp leu met ser val tyr val val glu leu 
GCC AAC- TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile glv ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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610 

Qer oi» ala glu val arg gin his arg glu ala arg pro ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG OCT GAC GGG CTG CGG 

640 

pro iie val asn met asp tyr val val gly ala arg thr phe arg 
CCG ATT C-TG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu Ivs arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu glv leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC C-TG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tvr phe val lvs val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC A*G AGC CAC GTC TCT ACC TTG 
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770 7 80 
thr 'aso leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

SOO 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

Dhe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys oln gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGC- CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 

880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

920 930 
thr ala phe val crln met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 
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940 

giy leu leu ieu asp thr arg thr leu glu val gin ser asp tyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 
ser ser :vr ala arg thr ser ile arg ala ser val thr phe asn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

arg gly one lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 590 
val ieu arc leu lvs cys his ser leu phe leu asp ieu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tvr arg phe his ala cys val leu gin leu pro phe 
CTG CAC- GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his a In gin val trp lvs asn pro thr phe phe ieu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 
ser aso thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala glv met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGG ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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arg thr ala crln thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu th- ala leu alu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe ivs thr lie leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

r^GCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAC^CGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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